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We define a compilation scheme for a constructor-based, strongly-sequential, graph rewriting system 
which shortcuts some needed steps. The object code is another constructor-based graph rewriting 
system. This system is normalizing for the original system when using an innermost strategy. Con¬ 
sequently, the object code can be easily implemented by eager functions in a variety of programming 
languages. We modify this object code in a way that avoids total or partial construction of the con¬ 
tractu of some needed steps of a computation. When computing normal forms in this way, both 
memory consumption and execution time are reduced compared to ordinary rewriting computations 
in the original system. 


1 Introduction 

Rewrite systems are models of computations that specify the actions, but not the control. The object of 
a computation is a graph referred to as an expression. The actions are encoded by rules that define how 
to replace (rewrite) one expression with another. The goal of a computation is to reach an expression, 
called a normal form, that cannot be further rewritten. 

In the computation of an expression, a rewrite system does not tell which subexpression should be 
replaced to reach the goal. 

Example 1. Consider the following rewrite system. The syntax is Curry l\17\l . 
loop = loop 

snd (_,y) = y (1) 

A computation of snd (loop,0) terminates with 0 if the second rule 0/(0 is ever applied, but goes on 
forever without making any progress if only the first rule is applied. 

In a computation a strategy is a policy or algorithm that defines both which subexpression should be 
replaced and its replacement. The intended goal of a strategy is to efficiently produce a normal form of 
an expression when it exists. A practical strategy, called needed, is known for the class of the strongly 
sequential term rewriting systems HSl. This strategy relies on the fact that, in every reducible expression 
e, there exists a redex, also called needed, that is reduced in any computation of e to a normal form. 

The needed strategy is defined and implemented as follows: given an expression e, while e is re¬ 
ducible, reduce an arbitrarily chosen, needed redex of e. In the systems considered in this paper, finding 
a needed redex is easy without look-ahead Q. This strategy is normalizing: if an expression e has a 
normal form, repeatedly reducing arbitrary needed redexes will terminate with that normal form. This 
strategy is also optimal in the number of reduced redexes for graph (not term) rewriting. 

The above outline shows that implementing a needed strategy is a relatively straightforward task. 
Surprisingly, however, it is possible to shortcut some of the needed steps in the computation. This paper 
shows how this shortcutting can be introduced into an implementation of a needed strategy. 
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Terminology and background information are recalled in Sect. The compilation scheme and its 
properties are in Sect. and The transformation that allows shortcutting needed redexes, and its 
properties, are in Sect. andSect, [^presents two benchmarks and sketches further opportunities to 
shortcut needed steps. Sect, [^discusses the application of our work to the implementation of functional 
logic languages. Related work and conclusion are in Sect.[^and[^ respectively. 

2 Preliminaries 

A rewrite system is a pair (ZU ^ in which £ = ‘if tt)^ is a signature partitioned into constructors and 

operations (or functions), is a denumerable set of variables, and is a set of rewrite rules defined 
below. Without further mention, we assume that the symbols of the signature have a type, and that any 
expression over the signature is well typed. 

An expression is a single-rooted, directed, acyclic graph defined in fhe customary way ifTTl Def. 2]. 
An expression e is a constructor form (or value) if, and only if, every node of e is labeled by a consfrucfor 
symbol. Consfrucfor forms are normal forms, buf nol vice versa. For example, head{[]) where head is fhe 
usual operation fhaf refurns fhe firsl elemenf of a (non-emply) lisf, is a normal form, buf nol a consfrucfor 
form. In a conslruclor-based sysfem, such expressions are regarded as failures or exceptions ralher fhan 
resulls of compulafions. Likewise, a head-constructor form is an expression whose roof node is labeled 
by a consfrucfor symbol. 

A rule is a graph wilh Iwo roofs abslracfing fhe lefl- and righl-hand sides respeclively. The rules 
follow fhe constructor discipline |[23l . Each rule’s leff-hand side is a pattern, i.e., an operation symbol 
applied fo zero or more expressions consisling of only constructor symbols and variables. Rules are 
lefl-linear, i.e., fhe leff-hand side is a free. 

The objecfs of a compulalion are graphs ralher fhan terms. Sharing some subexpressions of an 
expression is a requirement of functional logic programming miHisiiiii. Incidentally, this sharing 
ensures that needed redexes are never duplicated during a computation. The difference between our 
graphs and ordinary terms concerns only the sharing of subexpressions. 

A computation of an expression t is a possibly infinite sequence 

t — tQ —y t\ — y ... 

such that ti —)■ ti+\ is a rewrite step ifTTl Def. 23]. For all i, ti is a state of the computation of t. 

Given a rewrite system R, an expression of R is an expression over the signature of R. When s is 
a signature symbol and n is a natural number, s/n denotes that n is the arity of s. When t and u are 
expressions and v is a variable, [u / v] is the substitution that maps v to u, and t[u/v\ is the application of 
[u/v] to t. The reflexive closure of fhe rewrite relation is denoted 

Each operalion in St is inductively sequential', lhal is, ils rewrile rules are organized info a hierarchical 
slruclure called a definitional tree |T| which we informally recall below. An example of a definilional free 
is shown in Q. In a definitional free of an operafion /, fhere are up fo 3 kinds of nodes called branch, 
rule and exempt. Each kind confains a paffem of / and ofher items of information depending on fhe kind. 
A rule node wilh pattern 7i confains a rule of / whose leff-hand side is equal to n modulo renaming nodes 
and variables. An exempt node wilh pattern n confains no ofher informafion. There is no rule of / whose 
leff-hand side is equal fo n. A branch node wilh a pattern n confains children fhaf are sublrees of fhe 
definilional free. Af leasl one child is a rule node. The children are oblained by “narrowing” pattern n. 
Eel X be any variable of n, which is called inductive. For each consfrucfor c/m of fhe fype of x, fhere is 
a child whose pattern is oblained from n by inslanlialing x wilh c{x \,. ..x^), where x, is a fresh variable. 
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An operation f jn is inductively sequential fT] if there exists a definitional tree whose root has pattern 
f{xi ,. ..x„), where x,- is a fresh variable, and whose leaves eontain all, and only, the rules of /. A rewrite 
system is inductively sequential if all of its operations are inductively sequential. 

Inductively sequential operations can be thought of as “well designed” with respect to evaluation. 
To compute a needed redex of an expression e rooted by an operation /, match to e the pattern 7t of 
a maximal (deepest in the tree) node A of a definitional tree of /. If N is an exempt node, e has no 
constructor normal form, and the computation can be aborted. If A is a rule node, e is a redex and can be 
reduced by the rule in A. If A is a branch node, let x be the inductive variable of 7t and t the subexpression 
of e to which x is matched. Then, recursively compute a needed redex of t. 

The inductively sequential systems are the intersection na of the strongly sequential systems m 
and the constructor-based systems ll23l . The following notion f8] for inductively sequential systems is 
key to our work. We abuse the word “needed” because we will show that our notion extends the classic 
one idU. Our notion is a binary relation on nodes, or equivalently on the subexpressions rooted by these 
nodes, since they are in a bijection. 

Definition 1. Let R be an inductively sequential system, e an expression of R rooted by a node p, and n 
a node ofe. Node n is needed for e, and similarly is needed for p, if, and only if, in any computation ofe 
to a head-constructor form, the subexpression of e at n is derived to a head-constructor form. A node n 
(and the redex rooted by n, if any) of a state e of a computation in R is needed if, and only if, it is needed 
for some outermost operation-rooted subexpression ofe. 

Our “needed” relation is interesting only when both nodes are labeled by operation symbols. If e is 
an expression whose root node p is labeled by an operation symbol, then p is trivially needed for p. This 
holds whether or not e is a redex and even when e is already a normal form, e.g., head([]). In particular, 
any expression that is not a value has pairs of nodes in the needed relation. Finally, our definition is 
concerned with reaching a head-constructor form, not a normal form. 

Our notion of need generalizes the classic notion ifTSl . Also, since our systems follow the constructor 
discipline ll23l we are not interested in expressions that do not have a value. 

Lemma 1. Let R be an inductively sequential system and e an expression ofR derivable to a value. Ife' 
is an outermost operation-rooted subexpression ofe, and n is both a node needed for e' and the root of a 
redex r, then r is a needed redex ofe in the sense of El. 

Proof. Since e' is an outermost operation-rooted subexpression of e, any node in any path from the root 
of e to the root of e', except for the root of e', is labeled by constructor symbols. Hence, e can be derived 
to a value only if e' is derived to a value and e' can be derived to a value only if e' is derived to a head- 
constructor form. By assumption, in any derivation of e' to a head-constructor form r is derived to a 
head-constructor form, hence it is reduced. Thus, r is a needed redex of e according to El- D 

Lemma 2. Let R be an inductively sequential system, e an expression ofR, ei, e 2 and e^ subexpressions 
of e such that Ui is the root of ei and the label of ni is an operation, for i = 1,2,3. If nj, is needed for n 2 
and n 2 is needed for n\, then n^ is needed for n\. 

Proof. By hypothesis, if e^ is not derived to a constructor-rooted form, e 2 cannot be derived to a 
constructor-rooted form, and if e 2 is not derived to a constructor-rooted form, e\ cannot be derived 
to a constructor-rooted form. Thus, if eo, is not derived to a constructor-rooted form, e\ cannot be derived 
to a constructor-rooted form. □ 
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compile sr 

01 case of 

02 when branch{K,o, then 

03 y ^ ^ compile 

04 output “H(${7r}) =H(${7r[H(7r|o)]4)” 

05 when rule[n,l^r) then 
06 case r of 

07 when operation-rooted then 

08 output “H(${Z}) =H(${r})” 

09 when constructor-rooted then 

10 output “H(${Z}) = ${r}” 

11 when variable then 

12 for each constructor c/n of the sort of r 

13 let Z'—5-/ = (Z —)■ r)[c(ri,.. .r„)/r] 

14 output “H(${Z'}) = ${/}” 

15 output “H(${Z}) =H(${r})” 

16 when exempt[n) then 

17 output “H(${7r}) = abort” 


Figure 1: Procedure compile takes a definitional tree of an operation / of /? and produces the set of 
rules of H that pattern match /-rooted expressions. 

3 Compilation 

For simplicity and abstraction, we present the object code, C^, of as a constructor-based graph rewriting 
system as well. Cr has only two operations called head and norm, and denoted H and N, respectively. 
The constructor symbols of Cr are all, and only, the symbols of R. The rules of Cr have a priority 
established by the textual order. A rule reduces an expression t only if no other preceding rule could 
be applied to reduce t. These semantics are subtle, since t could become reducible by the preceding 
rule only after some internal reduction. However, all our claims about computations in Cr are stated 
for an innermost strategy. In this case, when a rule is applied, no internal reduction is possible, and the 
semantics of the priority are straightforward. 

Operation H is defined piecemeal for each operation of R. Each operafion of R contributes a number 
of rules dispatched by pattern matching with textual priority. The rules of H contributed by an operation 
with definitional tree ^ are generated by the procedure compile defined in Fig.[T] The infenf of H is fo 
fake an expression of R roofed by an operafion and derive an expression of R roofed by a consfrucfor by 
performing only needed sfeps. 

The expression “${x}” embedded in a siring, denofes inlerpolalion as in modem programming lan¬ 
guages, i.e., Ihe argumenf x is replaced by a siring represenfalion of ifs value. The nofalion t[u]p sfands 
for an expression equal fo t, in which Ihe subexpression identified by p is replaced by u. In procedure 
compile, Ihe nofalion is used fo “wrap” an application of H around Ihe subexpression of Ihe pallern 
al o, fhe inductive node. An example is Ihe Iasi rule of Q. The loop al sfalemenf 12 is for collapsing 
rules, i.e., rules whose righf-hand side is a variable. When Ihis variable malches an expression roofed by 
a consfrucfor of R, no furlher application of H is required; Olherwise, H is applied fo Ihe conlraclum. 
Symbol “abort” is nol considered an elemenl of Ihe signalure of Cr. If any redex is reduced fo “abort”. 
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the computation is aborted since it can be proved that the expression object of the computation has no 
constructor normal form. 

Example 2. Consider the rules defining the operation that concatenates two lists, denoted by the infix 
identifier 

□ ++y = y 

(x:xs)++y = x:(xs++y) ^ ' 

The definitional tree of operation “++ ” is pictorially represented below. The only branch node of this 
tree is the root. The inductive variable of this branch, boxed in the representation, is x. The rule nodes of 
this tree are the two leaves. There are no exempt nodes in this tree since operation “++” is completely 
defined. 

S++y 

[]++y (x:xs)++y (3) 


y X:(xs++y) 

Applying procedure compile to this tree produces the following output: 

H ([]++[]) = [] compile line #14 

H( []++(y:ys)) = (y:ys) compile line #14 

H([]++y) = H(y) compile line #15 (4) 

H((x :xs)++y) = x:(xs++y) compile line #10 

H(x++y) = H(H(x)++y) compile line #04 




Operation N of the object code is defined by one rule for each symbol of R. In the following metarules, 
cjm stands for a constructor oiR, f jn stands for an operation of R, and x,- is a fresh variable for every i. 

N(c(xi,...Xm)) =c(N(xi),...N(Xm)) 

N(/(xi,...x„)) = N(H(/(xi,...x„))) 


Example 3. The rules of N for the list constructors and the operation “++” defined earlier are: 

N([]) = [] 

N(x:xs) = N(x):N(xs) (6) 

N(x++y) = N(H(x++y)) 

Definition 2. The rewrite system consisting of the H rules generated by procedure compile for the 
operations ofR and the N rules generated according to (Q/or all the symbols ofR is the object code of 
R and is denoted Cr. 

Example 4. We show the computation of [1] ++ [2] in both R and Cr. We use the desugared notation 
for list expressions to more easily match the patterns of the rules of 

(l:[])++(2:[]) ^ 1: ([]++(2: [])) ^ 1:2: [] (7) 


and 
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N((l: [])++(2: [])) 

^ N(H((1: [])++(2: []))) 

^ N(l:([]++(2: [])) 

^ N(l):N([]++(2: [])) 

^ l:N(H([]++(2: []))) 

^ 1:N(2:[]) 

^ 1:N(2):N([]) 

^ 1 : 2 : [] 

Computation ([^ is longer than Q. If all the occurrences o/N and H are ” erased” from the states o/® 
a concept formalized shortly, and repeated states of the computation are removed, the remaining steps 
are the same as in The introduction and removal of occurrences o/N and H in ®, which lengthen 
the computation, represent the control, what to rewrite and when to stop. These activities occur in ® 
too, but are in the mind of the reader rather than explicitly represented in the computation. 


4 Compilation Properties 

Cr, the object code of R, correctly implements R. Computations performed by Cr produce the results of 
corresponding computations in R as formalized below. Furthermore, Cr implements a needed strategy, 
because every reduction performed by Cr is a needed reduction in R. In this section, we prove these 
properties of the object code. 

Let Expr be the set of expressions over the signature of Cr output by compile on input a rewrite 
system R. The erasure function S’ : Expr —)> Expr is inductively defined by: 

S{H{t)) = S{t) 

S{N{t)) = S{t) ( 9 ) 

S{s{ti,...tn)) =s{S{ti),...S{tn)) for s/neZR 

Intuitively, the erasure of an expression t removes all the occurrences of H and N from t. The result is an 
expression over the signature of R. 

Lemma 3. Let R be an inductively sequential system and H the head function ofCR. Eor any operation- 
rooted expression t ofR, H(t) is a redex. 

Proof. Let f jn be the root of t, and ST the definitional tree of / input to procedure compile. The pattern 
at the root of ST is f{x\ ,...x„), where each x,- is a variable. Procedure compile outputs a rule of H with 
left-hand side H(/(xi,.. .x„)). Hence this rule, or a more specific one, reduces t. □ 

Comparing graphs modulo a renaming of nodes, as in fhe nexf proof, is a sfandard fechnique |[m due fo 
fhe facl fhaf any node created by a rewrite is fresh. 

Lemma 4. Let R be an inductively sequential system and H the head function of Cr. Let t be an 
operation-rooted expression of R, and H(t) be reduced by a step resulting from the application of a 
rule r originating from statement 04 of procedure compile. The argument of the inner application o/H 
in the contractum is both operation-rooted and needed for t. 

Proof. Let be a definitional tree of the root of t. Let n be the pattern of the branch node n of ST from 
which rule r originates and let o be the inductive node of n. Since r rewrites t and 7i is the left-hand 
side of r modulo a renaming of variables and nodes, there exists a graph homomorphism a such that 
t = o{n). Our convention on the specificity of the rules defining H esfablishes fhaf no rule fexfually 
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preceding r in the definition of H rewrites t. Since procedure compile traverses ^ in post-order, every 
rule of H originating from a node descendant of n in textually precedes r in the definition of H. Let 
q = a(7r|o). For each constructor symbol c/n of /? of the sort of Tt\o, there is a rule of H with argument 
Tt[c{x\,...,Xn)\\o, where ,... are fresh variables, and this rule textually precedes r in the definition 
of H. Therefore, the label of q is not a constructor symbol, otherwise this rule would be applied to t 
instead of r. Since the step of t is innermost, q cannot be labeled by H either. Thus, the only remaining 
possibility is that q is labeled by an operation. We now prove that q is needed for t. If n\ and n 2 are disjoint 
nodes (neither is an ancestor of the other) of then the patterns of n\ and n 2 are not unifiable. This 
is because they have different constructors symbols at the node of the inductive variable of the closest 
(deepest) common ancestor. Thus, since t = o{lt), only a rule of R stored in a rule node of ^ below n can 
rewrite (a descendant of) t at the root, if any such a rule exists. All these rules have a constructor symbol 
at the node matched by o, whereas t has an operation symbol at q, the node matched by o. Therefore, t 
cannot be reduced (hence reduced to a head-constructor form) unless is reduced to a head-constructor 
form. Thus, q is needed for t. □ 

Example 5. The situation depicted by the previous lemma can be seen in the evaluation of 
t= ([1]++[2])++[3]. Accorr/mg to Q, H(t) —)■ H(H( [1]++[2])++[3]). The argument of the in¬ 
ner application o/H is both operation-rooted and needed for t. 

Lemma 5. Let R be an inductively sequential system and H the head function of Cr. Let t be an 
operation-rooted expression of R and let A denote an innermost finite or infinite computation H(t) = 
cq —>■ ei —>■ ... in Cr. 

1. For every index i in A, ^(ei) -A S‘{ei+\) in R. 

1. If A terminates (it neither aborts nor is infinite) in an expression u, then u is a head-constructor 
form of R. 

Proof. Claim 1: Let Z —r be the rule of H applied in the step a —c/+i. There are 3 cases for the origin 
of Z —)■ r. If Z —)• r originates from statement 04 of compile, then (F(ei) = (f (e;+i) and the claim holds. 
Otherwise Z —?■ r originates from one of statements 08, 10, 14 or 15. In all these cases, a subexpression 
of ei of the form H(w) is replaced by either H(m) (statements 08 and 15) or u (statements 10 and 14), 
in which w is an instance of the left-hand side of a rule of R and u is the corresponding right-hand side. 
Thus, in this case too, the claim holds. 

Claim 2: If A aborts or does not terminate, the claim is vacuously true. So, consider the last step of A. 
This step cannot originate from the application of a rule that places H at the root of the contractum, since 
another step would become available. Hence the rule of the last step is generated by statement 10 or 14 
of procedure compile. In both cases, the contractum is a head constructor form. □ 

If A denotes a computation H(f) = cq —?■••• in Cr, then, by Lemmaj^ we denote <S’(eo) -A <F(ei) ^ 

... with <^(A) and—with a slight abuse—we regard it as a computation in R. Some expression of ^(A) 
may be a repetition of the previous one, rather than the result of a rewrite step. However, it is more 
practical to silently ignore these duplicates than filtering them out at the expenses of a more complicated 
definition. We will be careful to avoid an infinite repetition of the same expression. We extend the above 
viewpoint to computations of N(f) in Cr, where t is any expression of R. 

Theorem 1, Let R be an inductively sequential system and H the head function of Cr. Let t be an 
operation-rooted expression of R and let A denote an innermost finite or infinite computation H(t) = 
CO —)• Cl Zn Cr. Every step of S’(A) is needed. 
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Proof. We prove that for every index i such that Cj is a state of A, every argument of an application of 
H in Ci is needed for S’{ei). Preliminarily, we define a relation “A” on the nodes of the states of S’{A) as 
follows. Let p and q be nodes of states <o{ei) and S{ej) of A’(A) respectively. We define p A ^ iff / < 7 or 
i = j and fhe expression af <7 is a proper subexpression of fhe expression af p in (S’{ei). Relafion is a 
well-founded ordering wifh minimum elemenf fhe roof of t. The proof of fhe fheorem is by induction on 
“a”. Base case: Direcfly from fhe definition of “need”, since t is roofed by an operation of R. Induction 
case: Lef q be fhe roof of fhe argumenf of an applicafion of H in ej for j > 0. We disfinguish whefher q 
is fhe roof of fhe argumenf of an applicafion of H in ej- 1 . If if is, fhen fhe claim is a direcf consequence 
of fhe inducfion hypofhesis. If if is nof, ej^\ —)• Cj is an applicafion of a rule r generafed by one of fhe 
stafemenfs 04, 08 or 15 of procedure compile. For sfafemenf 04, fhere is a node p of <^{ej) fhat by fhe 
induction hypofhesis is needed for S’{ej) and mafches fhe paffern n of fhe branch node of a definitional 
free from which rule r originafes. Lef q be fhe node of fhe subexpression of Cj roofed by p mafched by 
K af o. By Lemmaj^ q is needed for p. Since p is needed for S‘{ej), by Lemma[^ q is needed for 
and fhe claim holds. For sfafemenfs 08 and 15, q is fhe roof of fhe confracfum of fhe redex mafched by r 
which by fhe inducfion hypofhesis is needed for S{ej-\). Node q is still labeled by an operation, hence 
if is needed for S(ej) direcfly by fhe definition of “need”. □ 

Corollary 1. Let R be an inductively sequential system. Let t be an expression ofR and let A denote an 
innermost finite or infinite computation N(f) = cq ^ e\ ^ ... in Cr. Every step of (E{A) is needed. 

Proof. Operafion N of Cr applied fo an expression t of /? applies operafion H fo every oufermosf 
operation-roofed subexpression of t. All fhese expressions are needed by Def.[T] The claim is fherefore a 
direcf consequence of Th. [T] □ 

Corollary 2. Let R be an inductively sequential system. For all expressions t and constructor forms u of 
R, t u in R if and only if'H{t )^u in Cr modulo a renaming of nodes. 

Proof. Lef A denofe some innermosf compufafion of N(t). Observe fhaf if A ferminafes in a consfrucfor 
form u of R, fhen every innermosf compufafion of N(f) ferminafes in u because fhe order of fhe reducfions 
is irrelevanf. Therefore, we consider whefher A ferminafes normally. Case 1: A ferminafes normally. If 
N(t) A u, fhen by Lemma|^ poinf 1 , t A u. Case 2: A does nof terminate normally. We consider whefher 
A aborfs. Case 2a: A aborfs. Suppose N(t) = eo —)• e,, and fhe step of e, reduces a redex 

r fo “aborf”. By Theorem [T] r is needed for e,, buf fhere is no rule in R fhaf reduces r, hence t has 
no consfrucfor form. Case 2b: A does nof ferminafes. Every step of (E{A) is needed. The complete free 
unraveling |l9l Def. 13.2.9] of fhe rules of R and fhe stafes of (o’(A), gives an orthogonal term rewriting 
system and a compufafion of fhe unraveled t. Since redexes are innermosf, in fhis compufafion an infinite 
number of needed redexes are reduced. The hypernormalizafion of fhe needed sfrafegy 0 Secf. 9.2.2] 
shows fhaf hence t has no consfrucfor form. □ 

The objecf code Cr for a rewrife sysfem R is subjectively simple. Since innermosf reducfions suffice for 
fhe execufion, operafions H and N can be coded as funcfions fhaf fake fheir argument by-value. This is 
efficient in most programming languages. Corollary in conjunction with Theorem[TJ shows that Cr is 
a good object code: it produces the value of an expression t when t has such value, and it produces this 
value making only steps that must be made by any rewrite computation. One could infer that there cannot 
be a substantially better object code, but this is not true. The next section discusses why. 
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5 Transformation 

We transform the object code to avoid totally, or partially, constructing certain contracta. The transfor¬ 
mation consists of two phases. 

The first phase replaces certain rules of H. Let r be a rule of H in which H is recursively applied to a 
variable, say x, as in the third rule of Q. Rule r is replaced by the set Sr of rules obtained as follows. A 
rule rf is in Sr, iff is obtained from r by instantiating x with f(xi,.. .x„), where f/n is an operation of 
R,xi,...x„ are fresh variables, and the sorts of /(r i,... ) and x are the same. If a rule in Sr still applies 

H to another variable, it is again replaced in the same way. 

Example 6. The following rule originates from instantiating yfor “++” in the third rule 0/(0- 

H([]++(u++v)) = H(u++v) (10) 

The first phase of the transformation ensures that H is always applied to an expression rooted by some 
operation / of R. The second phase introduces, for each operation / of R, a new operation, denoted Hy. 
This operation is the composition of H with /, and then replaces every occurrence of the composition of 
H with / with Hy. 

Example 7. The second phase transforms ( |iOp into: 

H++([],u++v) = H++(u,v) (11) 


After the second phase, operation H can be eliminated from the object code since it is no longer invoked. 
We denote the transformed Cr with Tr and the outcome of the first phase on Cr with Cjj. The mapping T, 
from expressions of Cr to expressions of Tr, formally defines fhe fransformafion: 

rHy(T(ti),...T(t„)), ift = H(/(ti,...t„)); 

T(f) = < 5(T(fi),.. .T(f„)), if f = ^(fi,.. .t„), wifh 5 symbol of R; (12) 

[v, if t = V, wifh V variable. 

Tr is more efficienl fhan Cr because, for any operafion / of R, fhe applicafion of Hy avoids fhe allocation 
of a node labeled by /. This node is also likely fo be paffern mafched later. 

Example 8. Consider the usual length—of—a—list operation: 
length [] = 0 

length (_:xs) = 1+length xs 

The compilation o/® where we omit rules irrelevant to the point we are making, produces: 
H(length( [])) = 0 

H(length(_ :xs)) = H(l+length(xs)) (14) 

The transformation o/® where again we omit rules irrelevant to the point we are making, produces: 
Hlength ( [] ) ~ 0 

Hlength(-:xs)) = H+(1, length(xs) ) (15) 

Below, we show the traces of a portion of the computations of N (length [7]) executed by Cr (left) and 
Tr (right), where the number 7 is an irrelevant value. The rules of “+” are not shown. Intuitively, they 
evaluate the arguments to numbers, and then perform the addition. 
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N(H(length [7]) 

—)■ N(H(l+length []) 
^N(H(1+H(length [])) 
^N(H(l+0)) 

^N(l) 

-> 1 


N(Hiength([7]) 

^N(H+(1, length []) 

^N(H+(l,Hiength([]))) 

^N(H+(1,0)) 

^N(l) 

1 


Cr constructs the expression rooted by the underlined occurrence of “+ ”, and later pattern matches it. 
The same expression is neither constructed nor pattern matched by Tr. 


The transformation increases the size of a program. Certain rules are replaced by sets of rules. The 
number of rules in a replacing set is the number of constructors of some type. A coarse upper bound 
of the size of the transformed program is a constant factor of the size of the original program. Modern 
computers have gigabytes of memory. We believe that the growth in size could become a problem only 
in extreme cases, and likely would not be the most serious problem in those cases. 


6 Transformation Properties 

We show that both phases of the transformation described in the previous section preserve the object 
code computations. 

Lemma 6. Let R be an inductively sequential system. Every step of Cr is a step of C'j^ and vice versa, 
modulo a renaming of nodes. 

Proof. Every rule of Cjj is an instance of a rule of Cr. Hence every step of Cjj is a step of Cr. For the 
converse, let t — )■ m be a step of Cr where some rule r is applied. It suffices to consider the case in which 
t is the redex and the rule r applied in the step is not in Cjj. Let v be the variable “wrapped” by H in 
r. Rule r is output by statement either 04 or 15 of procedure compile. We show that in both cases the 
match of v, say s, is an operation-rooted subexpression of t. If r is output by statement 04, this property 
is ensured by LemmaIf r is output by statement 15, and the match of v were constructor-rooted, then 
some rule output by statement 14 of procedure compile, which textually precedes r and is tried first, 
would match t. Therefore, let //n be the root of s. By the definition of phase 1 of the transformation, rule 
r[f{x \,.. .Xn)/v] is in Cjj. Therefore, modulo a renaming of nodes, t —)• m in Cjj □ 

Corollary 3. Let R be an inductively sequential system. For every operation-rooted expression t and 
head-constructor form u of R, H(t) ^ u in C'j^ if, and only if, T(H(f)) ^ u inTR modulo a renaming of 
nodes. 

Proof. Preliminarily, we show that for any s, H(t) —)• 5 in iff T(H(t)) —)■ x{s) in Tr. Assume H(t) —)• s 
in Cjj. There exists a rule Z —)■ r of and a match (graph homomorphism) a such that H(f) = a(Z) and 
s = cj(r). From the definition of phase 2 of the transformation, t(Z) — z{r) is a rule of Tr. We show 
that this rule reduces T(H(t)) to t( 5). Since T is the identity on variables, and a is the identity on non 
variables, ao t = to a. Thus T(H(t)) = T(a(Z)) = a(T(Z)) —> a(T(r)) = T(a(r)) = t( 5). The converse 
is similar because there a bijection between the steps of Cjj and Tr. 

Now, we prove the main claim. First, the claim just proved holds also when H(f) is in a context. Then, 
an induction on the length of H(t) ^ n in Cjj shows that T(H(t)) ^ t(m) in Tr. Since by assumption u is 
an expression of R, by the definition of T, t(m) =u. □ 
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Finally, we prove that object code and transformed object code execute the same computations. 

Theorem 2. Let R be an inductively sequential system. For all expressions t and u ofR, N(t) A m in Cr 
if, and only if, N(t) A- u in Tr. 

Proof In the computation of N(t) in Cr, by the definition of T, each computation of H(5) in Cr, for 
some expression 5 , is transformed into a computation of t(H( 5)) in Tr. By Lemma|^ the former ends in 
a head-constructor form of R. Hence, by Corollary|^ t(H( 5)) ends in the same head-constructor form of 
R. Thus, N{t) ^ u in Tr produces the same result. The converse is similar. □ 

7 Benchmarking 

Our benchmarks use integer values. To accommodate a built-in integer in a graph node, we define a kind 
of node whose label is a builf-in infeger rafher fhan a signafure symbol. An arifhmefic operation, such as 
addifion, refrieves fhe infegers labeling ifs argumenf nodes, adds fhem fogefher, and allocafes a new node 
labeled by fhe resulf of fhe addifion. 

Our firsf benchmark evaluafes length (Z 1 ++Z 2 ), where length is fhe operation defined in ( [T3| ). In 
fhe fable below, we compare fhe same rewrifing compufafion execufed by Cr and Tr. We measure fhe 
number of rewrife and shorfcuf sfeps execufed, fhe number of nodes allocafed, and fhe number of node 
labels compared by paffern mafching. The rafio befween fhe execufion fimes of Tr and Cr varies wifh fhe 
implemenfafion language, fhe order of execufion of some insfrucfions, and ofher code defails fhaf would 
seem irrelevanf fo fhe work being performed. Therefore, we measure quantifies fhaf are language and 
code independenf. The fabular enfries are in unifs per 10 rewrife sfeps of Cr, and are consfanf funcfions 
of fhis value excepf for very shorf lisfs. For lisfs of one million elemenfs, fhe number of rewrife sfeps of 
Cr is fwo million. 


length {I 1 ++I 2 ) 

Cr 

Tr 

Or 

rewrife sfeps 

10 

6 

6 

shorfcuf sfeps 

0 

4 

4 

node allocafions 

20 

16 

12 

node mafches 

40 

26 

18 


The column labeled Or refers fo objecf code fhaf further shorfcufs needed sfeps using fhe same idea 
behind fhe fransformafion. For example, in fhe second rule of ( [T4| ), bofh argumenfs of fhe addifion in fhe 
righf-hand side are needed. This informafion is known af compile-fime, fherefore fhe compiler can wrap 
an applicafion of H around fhe righf operand of “+” in fhe righf-hand side of fhe rule. 

H(length(_ :xs)) = H(l+H(length(xs))) ( 16 ) 

The composition of H wifh length is replaced by Hiength during fhe second phase. The resulting rule 
is: 

^length (-• ~ H+(1 (xs) ) ( 17 ) 

Of course, fhere is no need fo allocafe a node for expression 1, fhe firsf argumenf of fhe addifion, every 
time rule ( [TS] ) or (dll' is applied. A single node can be shared by fhe entire compufafion. However, since 
fhe firsf argumenf of fhe applicafion of H+ is consfanf, fhis applicafion can be specialized or parfially 
evaluafed as follows: 
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^length (-• (xs) ) 

The application of rule ( [TS] ) allocates no node of the contractum. In our benchmarks, we ignore any 
optimization that is not directly related to shortcutting. Thus Cr, Tr and Or needlessly allocate this node 
every time these rules are applied. 

The number of shortcut steps of Tr and Or remain the same because, loosely speaking. Or shortcuts 
a step that was already shortcut by Tr, but the number of nodes allocated and matched further decreases. 
The effectiveness of Tr to reduce node allocations or pattern matching with respect to Cr varies with the 
program and the computation. 

Our second benchmark computes the n-th Fibonacci number for a relatively large value of n. The 
program we compile is: 
fib 0 = 0 

fib 1 = 1 (19) 

fib n = fib (n-1) + fib (n-2) 

To keep the example simple, we assume that pattern matching is performed by scanning the rules in 
textual order. Therefore, the last rule is applied only when the argument of f ib is neither 0 nor 1. 


fib{n) 

Cr 

Tr 

Or 

rewrite steps 

10 

8 

8 

shorfcuf sfeps 

0 

2 

2 

node allocafions 

24 

22 

10 

node mafches 

44 

26 

16 


The tabular entries are in units per 10 rewrite steps of Cr and are constant functions of this value except 
for very small arguments of fib. For n = 32, the number of steps of Cr is about 17.5 million. With 
respect to Cr, Tr avoids the construction of the root of the right-hand side of the third rule of ( [T^ . Or 
transforms the right-hand side of this rule into: 


H+(Hfib(H-(n,l)),Hfib(H-(n,2))) (20) 

since every node that is not labeled by the variable or the constants 1 and 2 is needed. In this benchmark, 
there is also no need to allocate a node for either 1 or 2 every time (201 is constructed/executed. With 
this further optimization, the step would allocate no new node for the contractum, and the relative gains 
of our approach would be even more striking. 


8 Functional Logic Programming 

Our work is motivated by the implementation of functional logic languages. The graph rewriting systems 
modeling functional logic programs are a superset of the inductively sequential ones. A minimal exten¬ 
sion consists of a single binary operation, called choice, denoted by the infix symbol “?”. An expression 
X ? y reduces non-deterministically to x or y. There are approaches lH |6l for rewriting computations 
involving the choice operation that produce all the values of an expression without ever making a non- 
deterministic choice. These approaches are ideal candidates to host our compilation scheme. 

Popular functional logic languages allow variables, called extra variables, which occur in the right- 
hand side of a rewrite rule, but not in the left-hand side. Computations with extra variables are executed 
by narrowing instead of rewriting. Narrowing simplifies encoding cerfain programming problems info 
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programs 14]. Since our object code selects rules in textual order, and instantiates some variables of the 
rewrite system, narrowing with our object code is not straightforward. However, there is a technique Q 
that transforms a rewrite system with extra variables into an equivalent system without extra variables. 
Loosely speaking, “equivalent”, in this context, means a system with the same input/output relation. In 
conjunction with this technique, our compiler generates code suitable for narrowing computations. 

9 Related Work 

The redexes that we reduce are needed to obtain a constructor-rooted expression, therefoer they are 
closely related to the notion of root-neededness of l[22]l . However, we are interested only in normal forms 
that are constructor forms. In contrast to a computation according to our object code may abort the 
computation of an expression e if no constructor normal form of e is reachable, even if e has a needed 
redex. This is a very desirable property in our intended domain of application since it saves useless 
rewrite steps, and in some cases may lead to the termination of an infinite computation. 

Machines for graph reduction have been proposed ifTOl |20l for the implementation of functional 
languages. While there is a commonality of intent, these efforts differ from ours in two fundamental 
aspects. Our object code is easily translated into a low-level language like C or assembly, whereas these 
machines have instructions that resemble those of an interpreter. There is no explicitly notion of need 
in the computations performed by these machines. Optimizations of these machines are directed toward 
their internal instructions, rather than the needed steps of a computation by rewriting, a problem less 
dependent on any particular mechanism used to compute a normal form. 

Our compilation scheme has similaties with deforestation Il24l . but is complementary to it. Both 
anticipate rule application, to avoid the construction of expressions that would be quickly taken apart 
and disposed. This occurs when a function producing one of these expressions is nested within a function 
consuming the expression. However, our expressions are operation-rooted whereas in deforestation they 
are constructor-rooted. These techniques can be used independently of each other and jointly in the same 
program. 

A compilation scheme similar to ours is described in [21. This effort makes no claims of correctness, 
of executing only needed steps and of shortcutting needed steps. Transformations of rewrite systems for 
compilation purposes are described in |[T2l[l9l . These efforts are more operational than ours. A compi¬ 
lation with the same intent as ours is described in O. The compilation scheme is different. This effort 
does not claim to execute only needed steps, though it shortcuts some of them. Shortcutting is obtained 
by defining ad-hoc functions whereas we present a formal systematic way through specializations of the 
head function. 

10 Conclusion 

Our work addresses rewriting computations for the implementation of functional logic languages. We 
presented two major results. 

The first result is a compilation scheme for inductively sequential graph rewriting systems. The object 
code generated by our scheme has very desirable properties: it is simple consisting of only two functions 
that take arguments by value, it is theoretically efficient by only executing needed steps, and it is complete 
in that it produces the value, when it exists, of any expression. The two functions of the object code are 
easily generated from the signature of the rewrite system and a traversal of the definitional trees of its 
operations. 
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The second result is a transformation of the object code that shortcuts some rewrite steps. Shortcut¬ 
ting avoids partial or total construction of the contractum of a step by composing one function of the 
object code with one operation symbol of the rewrite system signature. This avoids the construction of a 
node and in some cases and its subsequent pattern matching. Benchmarks show that the savings in node 
allocation and matching can be substantial. 

Future work will rigorously investigate the extension of our compilation technique to rewrite systems 
with the choice operation and extra variables, as discussed in Sect. as well as systematic opportunities 
to shortcut needed steps in situations similar to that discussed in Sect. [7] 
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